Poizon Scraper: China Sneaker Catalog
Pricing
from $4.99 / 1,000 product scrapeds
Poizon Scraper: China Sneaker Catalog
Scrape Poizon (Dewu) for sneaker, apparel, and luxury product catalogs. Extract brand, retail price, SKU, images, and authentication status. Uses Wayback Machine snapshots for reliable, unblocked data. 350M+ items, 1,400+ brands.
Pricing
from $4.99 / 1,000 product scrapeds
Rating
0.0
(0)
Developer
GetAScraper
Maintained by CommunityActor stats
0
Bookmarked
2
Total users
1
Monthly active users
2 hours ago
Last modified
Categories
Share
What does Poizon Scraper do?
Get live Poizon (Dewu) sneaker, apparel, and luxury listings: brand, retail price, SKU, images, and authentication status, ready for your spreadsheet in under 60 seconds. Pulls from the public Internet Archive, so no login or proxy is required. Ideal for market research, sneaker resale intelligence, and AI training corpora.
Why use Poizon Scraper?
- I am a sneaker reseller comparing Poizon retail prices against StockX and GOAT ask prices to spot underpriced China inventory before my competitors do.
- I am a brand-protection analyst at Nike or adidas watching new colorways, regional price gaps, and unauthorized reseller activity on the world's largest authentication marketplace.
- I am an AI engineer training a sneaker recognition model on the single largest public dataset of authentication-verified luxury resale activity (350M+ items, 1,400+ brands).
- I am a market-research analyst building weekly reports on streetwear pricing trends across China, with images, brand metadata, and SKUs in a clean JSON feed.
- I am a luxury reseller building a price-drop tracker for Poizon-curated drops so my Shopify store can resell at the right margin.
How to use Poizon Scraper
- Create a free Apify account (no credit card needed for the free trial).
- Open the Poizon Scraper in the Apify Console.
- Pick a category (Sneakers, Shoes, Apparel, etc.) and choose a data source.
- Set the Max Items limit and click Start.
- Download the dataset as JSON, CSV, Excel, or JSONL.
Input
| Field | Type | Required | Description |
|---|---|---|---|
startUrls | array of URLs | No | Direct Poizon category, search, or curated product URLs to scrape. When set, overrides the category and keyword fields. |
maxItems | integer | No | Cap on records returned. Free plan caps at 10. Paid plans scale up to 1,000,000. Defaults to 100. |
category | enum | No | Pick a Poizon category to auto-generate start URLs. Ignored when startUrls is provided. |
brand | string | No | Restrict the scrape to a single brand (e.g. Nike, adidas originals, Jordan). Leave blank for all brands. |
sortBy | enum | No | Result ordering: recommended (default), newest, price_low, price_high, or most_sold. |
minPriceUsd | integer | No | Only include products priced at or above this USD value. 0 for no minimum. |
maxPriceUsd | integer | No | Only include products priced at or below this USD value. 0 for no maximum. |
dataSource | enum | No | Where to fetch data from: wayback (recomendado, reliable), live (tries storefront first), or auto (live then falls back). |
includeImages | boolean | No | Include product image URLs in the output. Disable to slim down the dataset. Defaults to true. |
includeSku | boolean | No | Include the article number / SKU for each product. Defaults to true. |
includeSalesStats | boolean | No | Include total units sold and the sale tag (e.g. 1.1K+ Sold). Defaults to true. |
maxConcurrency | integer | No | Maximum number of parallel requests (1-20). Defaults to 5. |
debugLogging | boolean | No | Turn on verbose per-step logging (CDX queries, HTTP responses, headers) on Apify. Useful for diagnosing run issues. Defaults to false. |
proxyConfiguration | object | No | Proxy settings. Not needed for the recommended Internet Archive source. Required for live mode. |
Output
A dataset of Poizon catalog products with brand metadata, retail pricing, images, and authentication status. Free users get 50 results per run, paid plans scale to 1,000,000.
{"product_id": "1237613","url": "https://web.archive.org/web/20240725014341id_/https://www.poizon.com/category/sneakers","name": "Nike Dunk Low 'Panda Black White'","brand": "Nike","category": "sneakers","subcategory": "Lifestyle","retail_price_usd": 83,"retail_price_raw": "$83.00","currency": "USD","sku": "DD1391-100","authentication_status": true,"images": ["https://cdn-img.poizonapp.com/pro-img/origin-img/20240718/fdb1b89f4ceb470b9c2ed4c050ed204b.jpg"],"release_date": "2021-08-17","total_sold": 1395026,"sale_tag": "380 Sold","region": "US","source": "poizon-wayback","snapshot_date": "2024-07-25","wayback_url": "https://web.archive.org/web/20240725014341id_/https://www.poizon.com/category/sneakers","scraped_at": "2026-06-14T09:57:49.225Z"}
You can download the dataset in various formats such as JSON, HTML, CSV, or Excel.
Data table
| Field | Type | Description |
|---|---|---|
product_id | string | Poizon SPU identifier extracted from the product URL. |
url | string | Canonical URL (live or Internet Archive). |
name | string | Product name (HTML entities decoded). |
brand | string | Brand name (Nike, adidas originals, Jordan, etc.). |
category | string | Top-level category (sneakers, shoes, apparel, etc.). |
subcategory | string | Subcategory (Lifestyle, Chunky Sneakers, etc.). |
retail_price_usd | number | Retail price in USD. |
retail_price_raw | string | Formatted price string ($83.00). |
currency | string | Currency code, always USD for the international storefront. |
sku | string | Stock keeping unit (article number). |
authentication_status | boolean | Always true. Every Poizon item is authenticated. |
images | array | Product image URLs. |
release_date | string | ISO date string for the product release. |
total_sold | number | Cumulative units sold on Poizon. |
sale_tag | string | Sales tag (1.1K+ Sold). |
region | string | Region the price applies to (US, etc.). |
source | string | poizon-wayback (Internet Archive) or poizon-live (storefront). |
snapshot_date | string | Date of the Internet Archive snapshot, for wayback source. |
wayback_url | string | Full Internet Archive URL, for wayback source. |
scraped_at | string | ISO 8601 timestamp of when the row was pushed. |
Pricing
The Actor costs $4.99 per 1,000 results. A typical run extracting 100 products costs about $0.50. No proxy bandwidth is needed for the recommended Internet Archive source, so costs stay at the compute floor.
Tips or advanced options
- Use the Internet Archive source by default. It is the most reliable. The wayback setting queries the Internet Archive for the latest 2024-2025 snapshot and extracts real product data.
- Filter by brand and price. Set Brand (e.g. Nike, adidas originals, Jordan) and Min/Max Price (USD) to narrow the run. Filters apply after extraction, so combined with a category they cut the dataset to exactly what you need.
- Turn on debug logging. Enable
debugLogging: trueto see verbose per-step logs (CDX queries, HTTP responses, headers) on Apify. Helpful for diagnosing any run issue. - Slim down the dataset. Disable Include Product Images, Include SKU, or Include Sales Stats under the Enrichment section to keep rows small.
- Live mode caveats. The live storefront setting tries Poizon directly. Because the storefront filters most cloud IPs, live mode usually returns 0 products. Use it only with a residential proxy.
- Internet Archive snapshots are immutable. Each snapshot returns the same data on every run. Great for reproducible backtests and price history.
- What is coming next. A future version will scrape a single SKU across historical Internet Archive snapshots to track price evolution. Subscriptions at $50 per month are planned.
FAQ, disclaimers, and support
Is the Poizon scraper legal and compliant with Poizon terms of service?
Yes. The Poizon API scraper reads from the public Internet Archive Wayback Machine, which itself crawls only public web pages. The data is the same that any logged-out human could see in a browser. You are responsible for compliance with Poizon terms of service and applicable law when reusing the data.
Does the Poizon scraper get blocked by the storefront?
No. The recommended data source uses Internet Archive snapshots, which are unblocked and reliable. The live Poizon.com storefront filters most cloud IPs, but the Wayback Machine route never hits the live storefront, so the Poizon sneaker catalog and streetwear catalog scraper runs unblocked on every attempt.
How fresh is the scraped Poizon data?
Internet Archive snapshots are from 2024-2025. Poizon's sneaker and apparel catalog changes incrementally, so the data may be 1-2 years behind live. For real-time data, future versions will add other sources. The Poizon Dewu scraper always returns the same data on every run for a given snapshot, which makes it ideal for backtests and reproducible research.
Which Poizon categories and brands does the scraper support?
All six Poizon categories (Sneakers, Shoes, Apparel, Accessories, Bags, Toys) plus any custom URL you paste into Start URLs. You can filter the Poizon sneaker catalog by brand (Nike, adidas originals, Jordan, New Balance, ASICS, etc.) and by USD price range. The output includes the SKU, retail price, images, release date, total units sold, and authentication status for every product the source snapshot exposes.
Disclaimer: This Actor accesses public Internet Archive data only. It does not bypass authentication, paywalls, or anti-bot systems on the live Poizon site. Dewu is the registered trademark of Shanghai Dewu Information Group Co., Ltd. The Actor is not affiliated with, endorsed by, or sponsored by Dewu or Poizon.